Semantic Dictionary Encoding
falvotech.com·4h·
Discuss: Hacker News
💾Binary Formats
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·15h
🧠LLM Inference
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·13m·
Discuss: r/programming
🗂️Vector Indexes
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·15h·
🎯Vector Quantization
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·7h·
Discuss: r/LocalLLaMA
🧠LLM Inference
Crashes are loud. Leaks are quiet.
blog.bitdrift.io·19h
💾Persistence Strategies
Baking with Rails at scale: recipes in Ruby, cookware from Go, C, and Rust
evilmartians.com·19h
🏹Apache Arrow
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·2h
🧠LLM Inference
What is Algebraic about Algebraic Effects?
interjectedfuture.com·3h
💻Programming languages
CoDiCodec: Unifying Continuous and Discrete Compressed Representations of Audio
arxiv.org·15h
🗜️Zstd
When Open Source Isn't Good Enough
unstructured.io·19h
🔓Open Source Software
UTF-8 Is Beautiful
hackaday.com·14h
📋Markdown
Basic Guide to Einsum
ajcr.net·23h·
Discuss: Hacker News
🔄SIMD Programming
Balance between refactoring and inheritance in your code
github.com·7h·
Discuss: Hacker News
🪄Prompt Engineering
A Slotted Hash Cons for Alpha Invariance
philipzucker.com·39m·
Discuss: Hacker News
📑Inverted Indexes
A Dumb Introduction to z3. Exploring the world of constraint solvers with very simple examples.
asibahi.github.io·21h·
🧮SMT Solvers
Spectral Bottleneck in Deep Neural Networks: Noise is All You Need
arxiv.org·15h
🔢BitNet
Vibe Check: GPT-5 Codex Can Code for 35 Minutes Straight—If You Ask Nicely
kill-the-newsletter.com·1h
🪄Prompt Engineering
Weighted random generation in Python (2010)
eli.thegreenplace.net·21h·
Discuss: Hacker News
📑Inverted Indexes
Valuable News – 2025/09/15
vermaden.wordpress.com·10h
📡RSS